Measuring Conceptual Similarity by Spreading Activation over Wikipedia’s Hyperlink Structure

نویسندگان

  • Stephan Gouws
  • G-J van Rooyen
  • Herman A. Engelbrecht
چکیده

Keyword-matching systems based on simple models of semantic relatedness are inadequate at modelling the ambiguities in natural language text, and cannot reliably address the increasingly complex information needs of users. In this paper we propose novel methods for computing semantic relatedness by spreading activation energy over the hyperlink structure of Wikipedia. We demonstrate that our techniques can approach state-of-the-art performance, while requiring only a fraction of the background data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Similarity between Words Computed by Spreading Activation on an English Dictionary

This paper proposes a method for measuring semantic similarity between words as a new tool for text analysis. The similarity is measured on a semantic network constructed systematically from a subset of the English dictionary, LDOCE (Longman Dictionary of Contemporary English). Spreading activation on the network can directly compute the similarity between any two words in the Longman De ning V...

متن کامل

Semantic Information Retrieval based on Wikipedia Taxonomy

Information retrieval is used to find a subset of relevant documents against a set of documents. Determining semantic similarity between two terms is a crucial problem in Web Mining for such applications as information retrieval systems and recommender systems. Semantic similarity refers to the sameness of two terms based on sameness of their meaning or their semantic contents. Recently many te...

متن کامل

cs224w Visualizing Human Concept Linking Through Wikipedia

Wikipedia is a unique information network of all collective human knowledge together with the relationships between different entities and concepts. Articles range a variety of subjects and are usually contributed to by an active community knowledgeable on that subject. An article’s text can contain inline hyper links to other articles if an editor deems the conceptual connection strong enough ...

متن کامل

Measuring of Semantic Relatedness between Words based on Wikipedia Links

A novel technique of semantic relatedness measurement between words based on link structure of Wikipedia was provided. Only Wikipedia’s link information was used in this method, which avoid researchers from burdensome text processing. During the process of relatedness computation, the positive effects of two-directional Wikipedia’s links and four link types are taken into account. Using a widel...

متن کامل

An attractor model of lexical conceptual processing: simulating semantic priming

An attractor network was trained to compute from word form to semantic representations that were based on subject-generated features. The model was driven largely by higher-order semantic structure. The network simulated two recent experiments that employed items included in its training set (McRae and Boisvert, 1998). In Simulation 1, short stimulus onset asynchrony priming was demonstrated fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010